MongoDB-Hadoop Distributed and Scalable Framework for Spatio-Temporal Hazardous Materials Data Warehousing
نویسندگان
چکیده
Today the demand for Carriage of Dangerous Goods is experiencing significant increase on Moroccan market. Each year, more than 15 million tons of dangerous goods are transported by road in Morocco. The transport of dangerous goods is regulated by a legal framework in line with international standards; including the European Agreement concerning the International Carriage of Dangerous Goods by Road (ADR) came into effect in Morocco in June 2003. With the aim to facilitate the deployment of some ADR guidelines, this project offers essential IT solutions for its application at the regional and national scale. In this context, the project involves development of software components for calculating safer time-dependent routes, spatial analysis of Voronoi network and establishment of a decisional database to capture HAZMAT (Hazardous Materials) shipments and occurring incidents or accidents. The framework that we propose assumes three major software components. The first is dedicated to the processing of time-dependent routes that considers risk and traffic conditions. The second is developing the transport network partitioning using Voronoi graph diagrams. This component is used for the purposes of management interventions. Finally, the last component provides a NoSQL database for the storage of HAZMAT events and shipping data. Other supports components are provided for collecting and visualizing of data and spatio-temporal events related to HAZMAT. All given components are integrated into an interoperable software infrastructure respecting intelligent transport systems architecture. This infrastructure is distributed and based on a service-oriented architecture. It is also scalable by integration of MongoDB with Hadoop for largescale distributed data processing. In this work, we also give an assessment of the performance, scalability and fault-tolerance of using MongoDB with Hadoop, towards the goal of identifying the right architecture and software environment for HAZMAT spatio-temporal data analytics.
منابع مشابه
ST-Hadoop: A MapReduce Framework for Spatio-Temporal Data
This paper presents ST-Hadoop; the first full-fledged opensource MapReduce framework with a native support for spatio-temporal data. ST-Hadoop is a comprehensive extension to Hadoop and SpatialHadoop that injects spatio-temporal data awareness inside each of their layers, mainly, language, indexing, and operations layers. In the language layer, ST-Hadoop provides built in spatio-temporal data t...
متن کاملA Novel Spatio-Temporal Data Storage and Index Method for ARM-Based Hadoop Server
During the past decade, a vast number of GPS devices have produced massive amounts of data containing both time and spatial information. This poses a great challenge for traditional spatial databases. With the development of distributed cloud computing, many highperformance cloud platforms have been built, which can be used to process such spatio-temporal data. In this research, to store and pr...
متن کاملAdaptive Dynamic Data Placement Algorithm for Hadoop in Heterogeneous Environments
Hadoop MapReduce framework is an important distributed processing model for large-scale data intensive applications. The current Hadoop and the existing Hadoop distributed file system’s rack-aware data placement strategy in MapReduce in the homogeneous Hadoop cluster assume that each node in a cluster has the same computing capacity and a same workload is assigned to each node. Default Hadoop d...
متن کاملAn Efficient Design and Implementation of an MdbULPS in a Cloud-Computing Environment
Flexibly expanding the storage capacity required to process a large amount of rapidly increasing unstructured log data is difficult in a conventional computing environment. In addition, implementing a log processing system providing features that categorize and analyze unstructured log data is extremely difficult. To overcome such limitations, we propose and design a MongoDB-based unstructured ...
متن کاملWarehousing and OLAPing Complex, Spatial and Spatio-Temporal Data
Preface Complex, spatial and spatio-temporal data arise in a plethora of modern database and data mining applications and complex information systems. Complex, spatial and spatio-temporal data require more and more for effective and efficient models, algorithms and techniques for representing, managing, querying , indexing and discovering useful knowledge beyond such kind of data. A successful ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2014